# Long Context Understanding
Internvl3 38B Instruct GGUF
Apache-2.0
InternVL3-38B-Instruct is an advanced Multimodal Large Language Model (MLLM) that demonstrates exceptional overall performance, with strong multimodal perception and reasoning capabilities.
Image-to-Text
Transformers

I
unsloth
1,236
2
Internvl3 8B Instruct GGUF
Apache-2.0
InternVL3-8B-Instruct is an advanced multimodal large language model (MLLM) that demonstrates exceptional overall performance, with strong multimodal perception and reasoning capabilities.
Text-to-Image
Transformers

I
unsloth
2,412
1
Internvl3 14B Instruct GGUF
Apache-2.0
InternVL3-14B-Instruct is an advanced Multimodal Large Language Model (MLLM) that demonstrates exceptional multimodal perception and reasoning capabilities, supporting various tasks such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.
Image-to-Text
Transformers

I
unsloth
982
1
Qwen3 14B Base Unsloth Bnb 4bit
Apache-2.0
Qwen3-14B-Base is the latest generation large language model in the Qwen series, offering a dense model with 14.8 billion parameters, supporting a context length of 32k and covering 119 languages.
Large Language Model
Transformers

Q
unsloth
2,120
1
Internvl3 2B AWQ
Other
InternVL3-2B is an advanced Multimodal Large Language Model (MLLM) developed by OpenGVLab, featuring exceptional multimodal perception and reasoning capabilities, supporting tool usage, GUI agents, industrial image analysis, 3D visual perception, and more.

I
OpenGVLab
677
1
Internvl3 9B Instruct
MIT
InternVL3-9B-Instruct is the supervised fine-tuned version of the InternVL3 series, featuring powerful multimodal perception and reasoning capabilities, supporting various modalities such as images, text, and videos.
Image-to-Text
Transformers Other

I
OpenGVLab
220
2
Internvl3 8B Instruct
Other
InternVL3-8B-Instruct is an advanced Multimodal Large Language Model (MLLM) that demonstrates exceptional multimodal perception and reasoning capabilities, supporting various functionalities such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.
Image-to-Text
Transformers Other

I
OpenGVLab
885
2
Internvl3 1B Instruct
Apache-2.0
InternVL3-1B-Instruct is the supervised fine-tuned version of the InternVL3 series, based on native multimodal pretraining, with exceptional multimodal perception and reasoning capabilities.
Image-to-Text
Transformers Other

I
OpenGVLab
705
5
Internvl3 1B
Other
InternVL3-1B is a 1B-parameter multimodal large language model in the InternVL3 series, integrating the InternViT visual encoder and Qwen2.5 language model, with exceptional multimodal perception and reasoning capabilities.

I
FriendliAI
71
1
Yi 1.5 34B
Apache-2.0
Yi-1.5 is an upgraded version of the Yi model, demonstrating superior performance in programming, mathematics, reasoning, and instruction-following capabilities
Large Language Model
Transformers

Y
01-ai
404
48
Featured Recommended AI Models